Afterlife: A Distributed and Recoverable File System Based on a Unified Approach to Logging

نویسنده

  • Edmond Lau
چکیده

We present a distributed and recoverable file system called Afterlife that only requires a single replicated log to recover from failures in both the file system layer and the storage layer. Based on this unified log, we devise recovery procedures for both file server and block server failures that can run as background processes while the file system continues serving incoming requests. Batch flushing of log records mitigates the performance impact of logging. Our design allows the batch flushing to be integrated with the file server block cache consistency scheme and lazy lock release scheme. Multiple identical block server replicas enable high performance file system read operations and provide availability in the face of failure. We evaluate their performance on our Afterlife prototype of three block servers and two file servers, and show that logging and data replication incurs only a factor of 10 performance versus a conventional NFS file server.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing

In a distributed system using message logging and checkpointing to provide fault tolerance there is always a unique maximum recoverable system state regardless of the message logging protocol used The proof of this relies on the observation that the set of system states that have occurred during any single execution of a system forms a lattice with the sets of consistent and recoverable system ...

متن کامل

Distributed System Fault Tolerance Using Message Logging and Checkpointing

Fault tolerance can allow processes executing in a computer system to survive failures within the system This thesis addresses the theory and practice of transparent fault tolerance methods using message logging and checkpointing in distributed systems A general model for reasoning about the behavior and correctness of these methods is developed and the design implementation and performance of ...

متن کامل

Distr ibuted System

Fault tolerance can allow processes executing in a computer system to survive failures within the system This thesis addresses the theory and practice of transparent fault tolerance methods using message logging and checkpointing in distributed systems A general model for reasoning about the behavior and correctness of these methods is developed and the design implementation and performance of ...

متن کامل

An efficient causal logging scheme for recoverable distributed shared memory systems

This paper presents a causal logging scheme for the lazy release consistent distributed shared memory systems. Causal logging is a very attractive approach to provide the fault tolerance for the distributed systems, since it eliminates the need of stable logging. However, since inter-process dependency must causally be transferred with the normal messages, the excessive message overhead has bee...

متن کامل

Accounting Information System Adoption Based on Self-Assessed Wisdom

Accounting information system can be defined as a processing system of data and economic transactions to provide users information of making decision. The goal of this research is to explore self-assessed wisdom effect considering individual personality traits on accounting information system (AIS) adoption. To have access to this goal Unified Theory of Acceptance and Use of Technology (Venkate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005